System and method for triggering analysis of an object for malware in response to modification of that object

ABSTRACT

According to one embodiment, a system featuring one or more processors and memory that includes monitoring logic. During operation, the monitoring logic is configured to monitor for and detect a notification message that is directed to a destination other than the monitoring logic and identify an event associated with a change in state of a data store associated with the file system to occur. The notification message, at least in part, triggers a malware analysis to be conducted on an object associated with the state change event.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application is a continuation of U.S. application Ser. No. 14/985,287, filed on Dec. 30, 2015, now U.S. Pat. No. 10,133,866, issued Nov. 20, 2018, the entire contents of this application is incorporated by reference herein.

FIELD

Embodiments of the disclosure relate to cyber security. More particularly, embodiments of the disclosure are related to a system and method for triggered analysis of an object for a presence of malware based on an event detected at a file system.

GENERAL BACKGROUND

Over the last decade, file sharing systems that are accessible over the Internet or other publicly accessible networks have been increasingly targeted for malicious attack. One type of malicious attack may involve an attempt, normally through unsuspected uploading of malicious data (e.g., software, data, command(s), etc.) within content stored within a file sharing system, to infect any or all computers that upload the content. The malicious data, generally referred to as “malware,” may allow a third party to adversely influence or attack normal operations of the computer where the malicious attack is directed to a vulnerability associated with a specific application (e.g., browser application, document reader application, data processing application, etc.).

For instance, it is recognized that the malicious data may include a program or file that is harmful by design to the computing device. The malicious data may include computer viruses, worms, or any other executable (binary) that gathers or attempts to steal information from the computer, or otherwise operates without permission. The owners of the computers are often unaware that the malicious data has been added to their computers and is in operation.

Various processes and devices have been employed to prevent malicious attacks and other security threats on a file sharing system. Previously, security appliances were placed in-line with a storage server in an attempt to detect malware, in the form of an exploit or some sort of malicious software, as it is being routed into the storage server. However, for that deployment, conventional security appliances were required to understand and process packets configured in accordance with a storage protocol supported by a file system utilized by the storage server, where file system storage protocols are highly divergent. In fact, different types of file system may support different storage protocols and even different storage protocols may be used on different versions of the same type of file system. Additionally, the conventional in-line security appliances caused latency in the retrieval of files or other documents from the storage server. This latency adversely influenced the overall user experience provided by the file sharing system.

In fact, a security appliance offered by FireEye, Inc., the assignee of the present patent application, employs a two-phase malware detection approach to analyze files stored on a file system. This security appliance typically runs an analysis by traversing a storage tree to identify files to scan, and comparing the time of the last scan with the last modification of the file to reduce overhead by limiting its analysis to avoid repeating the scans of files not modified since the prior scanning period. It is noted that the complexity of this type of security appliance greatly increases as the storage volumes increase and storage protocols utilized by the file systems change.

BRIEF DESCRIPTION OF THE DRAWINGS

Embodiments of the disclosure are illustrated by way of example and not by way of limitation in the figures of the accompanying drawings, in which like references indicate similar elements and in which:

FIG. 1 is an exemplary block diagram of a physical representation of an enterprise network including a storage server that is part of a file system in communication with a threat detection system.

FIG. 2 is an exemplary embodiment of the storage server of FIG. 1.

FIG. 3 is an exemplary block diagram of a logical representation of the storage server of FIG. 2.

FIG. 4 is an exemplary embodiment of the threat detection system that communicates with monitoring logic deployed within a file system of the storage server of FIG. 2.

FIG. 5 is an exemplary embodiment of operations conducted during the inter-connectivity between the monitoring logic and the threat detection system of FIGS. 2-4.

DETAILED DESCRIPTION

Various embodiments of the disclosure are directed to a threat detection system that takes advantage of notification messages that are issued by many types of conventional storage systems, such as file system or a database system for example, in response to a state change event. A state change event represents a detected activity that caused a change in state of a data store within the storage system. As an example, the state change event can occur in response to a requested modification of a stored object (e.g., file, document, etc.) such as a file being updated and rewritten to the data store. As another example, the state change event can occur in response to a request to store an object into the data store such as a new file being stored within the data store where subsequent retrieval and modification of the new file is controlled by the storage system. For convenience, the storage system may be described in terms of a file system within the description that follows, but the scope of the claims is not necessarily so limited.

According to one embodiment of the disclosure, monitoring logic may be configured to monitor for (sometimes referred to as “hook”) signaling issued in response by storage control logic, which may be part of the file system that controls storage and retrieval of objects within a storage server. The signaling may include a notification message that identifies an occurrence of a state change event, where the notification message occurs after completion of the change of state (e.g., file operation). However, it is contemplated that the notification message may occur prior to completion of the change of state, provided that the storage of the object is completed before malware detection analysis is conducted. For instance, the notification message may be prompted in response to receipt of a request message directed to a kernel mode of the storage server to change the state of the object (e.g., add, delete or modify the object in the data store) and/or a response message from the kernel mode of the storage system to indicate that the requested state change has been completed.

Being configured to interact with an Application Programming Interface (API) provided by the storage (file) system, the monitoring logic is able to monitor, using the API, for one or more notification messages that are responsive to particular state change events. In response to detecting a notification message, the monitoring logic extracts an identifier of the object upon which a state change event has occurred (hereinafter referred to as the “suspect object”). The identifier provides information that specifies a unique location of the suspect object within the storage system, where the monitoring logic passes the identifier of the suspect object (and/or additional data associated with the identifier) to the threat detection system. The threat detection system uses the receipt of the identifier as a trigger to obtain the suspect object from the storage (file) system and analyze the suspect object to confirm that the suspect object is free of malware.

Herein, according to one embodiment of the disclosure, the identifier of the suspect object (hereinafter “object identifier”) may include a file path (e.g., a pointer to a storage location of the suspect object within the data store of the storage server as assigned by the file system). It is contemplated that, according to this embodiment, the object identifier may be represented as a string of characters separated by a delimiting character (e.g. “/”) that represent a directory tree hierarchy as organized by the file system. According to another embodiment of the disclosure, the object identifier may include a unique name assigned to the suspect object (e.g., file name) by the file system.

Stated differently, the malware analysis conducted by the threat detection system may be triggered by receipt of a notification message in response to a state change event (e.g., adding, deleting or modifying a stored object). The notification message may be intercepted, trapped, or monitored, sometimes referred to as “hooked”, by the monitoring logic. Upon detecting the notification message (or portions thereof), the monitoring logic identifies the suspect object, namely the object being added, deleted or modified within the storage system. Such identification may be performed by the monitoring logic extracting metadata from the notification message, where the metadata may include the object identifier that identifies a location of the suspect object in the storage server. Representing a location of the suspect object, the object identifier may be in the form of a path to a storage location of the suspect object, name of the suspect object, or the like. The object identifier is provided to the threat protection system that, after receipt, may fetch the object and conduct a static analysis and/or behavioral (dynamic) analysis on the suspect object to determine whether the suspect object is malicious. This static analysis may be conducted by comparing characteristics of the suspect object to known malicious objects (black list) or known benign objects (white list) while the behavioral analysis may be conducted by processing the suspect object, accessed via the object identifier, and determining whether the processing of the suspect object causes any anomalous (unexpected) behaviors to occur.

Furthermore, the monitoring logic may provide information to the threat detection system to initiate a comprehensive alert message that notifies an administrator of the details (e.g., storage location, IP address, object name, etc.) of an object under analysis that is currently stored in the data store of the storage server. Furthermore, the monitoring logic provides a communication path between the storage system and the threat detection system so that, in response to classifying the object under analysis as malicious, the object is removed from the data store or quarantined. Additionally, or in the alternative, the storage (file) system may be configured to substitute the suspect object deemed to be malicious by the threat detection system with a placeholder object (e.g., a text file, etc.). When the placeholder object is subsequently accessed by the electronic device, the placeholder object causes the electronic device processing the placeholder object to generate a notification that warns the user of removal and/or quarantine of the subject object. The warning may include more details about the suspect object (e.g., information regarding the malware type present in the suspect object, contact information for an administrator of the storage system, etc.).

In light of the foregoing, the behavioral analysis of the suspect object is based on and responsive to the “hooked” notification message that is already being issued by the storage control logic that is part of the file system when the suspect object undergoes a change in (storage) state. Additionally, the behavioral analysis may be performed out-of-band instead of on the same path as the request message from the electronic device that is seeking access to the suspect object.

I. Terminology

In the following description, certain terminology is used to describe various features of the invention. For example, the terms “logic,” and “engine” may be representative of hardware, firmware or software that is configured to perform one or more functions. As hardware, logic (or engine) may include circuitry having data processing or storage functionality. Examples of such circuitry may include, but are not limited or restricted to a hardware processor (e.g., microprocessor with one or more processor cores, a digital signal processor, any type of programmable gate array, a microcontroller, an application specific integrated circuit “ASIC”, etc.), a semiconductor memory, or combinatorial elements.

Logic (or engine) may be software such as one or more processes, one or more instances, Application Programming Interface(s) (API), subroutine(s), function(s), applet(s), servlet(s), routine(s), source code, object code, shared library/dynamic link library (dll), or even one or more instructions. This software may be stored in any type of a suitable non-transitory storage medium, or transitory storage medium (e.g., electrical, optical, acoustical or other form of propagated signals such as carrier waves, infrared signals, or digital signals). Examples of non-transitory storage medium may include, but are not limited or restricted to a programmable circuit; non-persistent storage such as volatile memory (e.g., any type of random access memory “RAM”); or persistent storage such as non-volatile memory (e.g., read-only memory “ROM”, power-backed RAM, flash memory, phase-change memory, etc.), a solid-state drive, hard disk drive, an optical disc drive, or a portable memory device. As firmware, the logic (or engine/component) may be stored in persistent storage.

The term “object” generally relates to a collection of data, whether in transit (e.g., over a network) or at rest (e.g., stored), often having a logical structure or organization that enables the object to be classified for purposes of analysis for malware. Examples of different types of objects may include a self-contained file that is separate from or is part of a flow. A “flow” generally refers to related packets that are received, transmitted, or exchanged within a communication session. For convenience, a packet broadly refers to a series of bits or bytes having a prescribed format. The object may correspond to a non-executable or executable file. Examples of a non-executable file may include a document (e.g., a Portable Document Format “PDF” document, word processing document such as Microsoft® Office® document, Microsoft® Excel® spreadsheet, etc.), a downloaded web page, a collection of documents (e.g., a compressed file including two or more documents), or the like. An executable file may be a program that may be made available to an operating system (OS) or an application within the storage server, where an out of the program may be received by a number of electronic devices.

The term “message” generally refers to information placed in a prescribed format. Each message may be in the form of one or more packets, frames, HTTP-based transmissions, a Short Message Service (SMS) text, a Simple Mail Transfer Protocol (SMTP) transmission, or any other series of bits having the prescribed format.

The term “network device” should be generally construed as electronics with data processing capability and/or a capability of connecting to any type of network, such as a public network (e.g., Internet), a private network (e.g., a local area network “LAN”, wireless LAN, etc.), or a combination of networks. Examples of a network device may include, but are not limited or restricted to, the following: a security appliance that includes any system or subsystem configured to perform functions associated with malware detection on an incoming object; a server, a mainframe, firewall, a router; or an endpoint device (e.g., a laptop, a smartphone, a tablet, a desktop computer, a netbook, a medical device, or any general-purpose or special-purpose, user-controlled network device).

According to one embodiment, the term “malware” may be construed broadly as any code or activity that initiates a malicious attack and/or operations associated with anomalous or unwanted behavior. For instance, malware may correspond to a type of malicious computer code that executes an exploit to take advantage of a vulnerability, for example, to harm or co-opt operation of a network device or misappropriate, modify or delete data. Malware may also correspond to an exploit, namely information (e.g., executable code, data, command(s), etc.) that attempts to take advantage of a vulnerability in software and/or an action by a person gaining unauthorized access to one or more areas of a network device to cause the network device to experience undesirable or anomalous behaviors. The undesirable or anomalous behaviors may include a communication-based anomaly or an execution-based anomaly, which, for example, could (1) alter the functionality of a network device executing application software in an atypical manner (a file is opened by a first process where the file is configured to be opened by a second process and not the first process); (2) alter the functionality of the network device executing that application software without any malicious intent; and/or (3) provide unwanted functionality which may be generally acceptable in another context. Additionally, malware may be code that initiates unwanted behavior which may be, as one example, uploading a contact list from an endpoint device to cloud storage without receiving permission from the user.

The term “interconnect” may be construed as a physical or logical communication path between two or more network devices or between different logic (engine/components). For instance, a physical communication path may include wired or wireless transmission mediums. Examples of wired transmission mediums and wireless transmission mediums may include electrical wiring, optical fiber, cable, bus trace, a radio unit that supports radio frequency (RF) signaling, or any other wired/wireless signal transfer mechanism. A logical communication path may include an inter-process communication (IPC) mechanism or other communication mechanism that allows for signaling between different logic.

The term “computerized” generally represents that any corresponding operations are conducted by hardware in combination with software or firmware.

Lastly, the terms “or” and “and/or” as used herein are to be interpreted as inclusive or meaning any one or any combination. Therefore, “A, B or C” or “A, B and/or C” mean “any of the following: A; B; C; A and B; A and C; B and C; A, B and C.” An exception to this definition will occur only when a combination of elements, functions, steps or acts are in some way inherently mutually exclusive.

II. General System Architecture

Referring to FIG. 1, an exemplary block diagram of a physical representation of an enterprise network 100 that features a storage server 120, which provides a storage system 130 for controlling the storage and retrieval of objects (e.g., non-executable files). As shown, the enterprise network 100 comprises a network 110 including one or more interconnects that provide connectivity between the storage server 120, one or more network devices 140 ₁-140 _(N) (N≥1), and a threat detection system 150.

Herein, the storage system 130 may include, but is not limited or restricted to a file system that allows each of the network devices 140 ₁-140 _(N) to store and/or access one or more objects (e.g. files) stored in the storage server 120. Communicatively coupled to or integrated within a portion of the file system 130, monitoring logic 160 is configured to monitor for selected state change events, and in particular a notification message 170 associated with any of the selected state change events. The notification message 170 indicates a change in state of a suspect object 180 that is resides in or is targeted for storage in a data store 185 of the storage server 120 (e.g., modification of a stored file within the data store 185, adding a file for storage within the data store 185 of the storage server 120, deleting a file from the data store 185 of the storage server 120, etc.). According to one embodiment, the monitoring logic 160 may be a plug-in communicatively coupled to an Application Programming Interface (API) associated with the portion of the storage (file) system 130. Of course, it is contemplated that the monitoring logic 160 may be a software component that is different than a plug-in or such functionality may be integrated as part of the file system 130.

In response to detecting the notification message 170, the monitoring logic 160 routes some or all of the data within notification message 170, most notably metadata 190 for identifying a storage location of the suspect object 180 within the storage server 120, to the threat detection system 150. In response to receipt of the metadata 190 (sometimes referred to as the “object identifier” 190), the threat detection system 150 may access the suspect object 180 and conduct static and/or behavioral (dynamic) analysis on some or all of the suspect object 180 to determine whether or not the suspect object 180 has a probability of being associated with a malicious attack that exceeds a prescribed level of probability (e.g., greater than 50%, 70%, 80%, or 90%, etc.).

In the event that the suspect object 180 is determined by the threat detection system 150 to be malicious, the threat detection system 150 may initiate an alert message to an administrator as described below. Furthermore, the threat detection system 150 may return a message 195 to the monitoring logic 160 that identifies the suspect object 180 is malicious. This may cause the storage (e.g., file) system 130 to initiate an operation to remove the suspect object 180 from the data store 185 of the storage server 120 or re-locate the suspect object 180 within the data store 185. Optionally, although not shown, logic within the storage (e.g., file) system 130 may substitute the suspect object 180 with a text object (e.g., file) (not shown) operating as a placeholder. The text file may cause display of a message on a display screen of a network device (e.g., network device 140 ₁) to identify that the suspect object 180 has been quarantined or removed, and in some cases, information that allows the entity attempting to access the suspect object 180 to contact an administrator.

Referring now to FIG. 2, an exemplary block diagram of a physical representation of a network device (e.g., storage server 120 of FIG. 1) is shown. The storage server 120 is configured with a storage (e.g., file) system 130 that controls storage and retrieval of one or more object(s) 200 (e.g., files), including the suspect object 180 of FIG. 1, within a non-transitory storage medium 220. Herein, the storage server 120 comprises one or more hardware processors (referred to as “processor(s)”) 210, the non-transitory storage medium 220, one or more network interfaces (referred to as “network interface(s)”) 230, and one or more network devices (referred to as “network device(s)”) 240 connected by a system interconnect 250, such as a bus. These components are at least partially encased in a housing 260, which is made entirely or partially of a rigid material (e.g., hardened plastic, metal, glass, composite, or any combination thereof) that protects these components from environmental conditions.

The processor(s) 210 is a multipurpose, programmable component that is configured to accept digital data as input and process the input data in accordance with stored instructions. The input data may include a storage access request message (e.g., file write request, file create request, file delete request, etc.) from an endpoint device controlled by a user. One example of a processor may include an Intel® x86 central processing unit (CPU) with an instruction set architecture. Alternatively, a processor may include another type of CPU, a digital signal processor (DSP), an Application Specific Integrated Circuit (ASIC), a field-programmable gate array (FPGA), or other logic with data processing capability. The processor(s) 210 and the file system 130 within the non-transitory storage medium 220 collectively operate as a system resource that allows for storage and subsequent retrieval of one or more objects 200, such as a non-executable file for example, remotely from the endpoint device (e.g., network device 140 ₁ of FIG. 1).

The network device(s) 240 may include various input/output (I/O) or peripheral devices, such as a keyboard, key pad, touch screen, or mouse for example. Each network interface 230 may include one or more network ports containing the mechanical, electrical and/or signaling circuitry needed to connect the storage server 120 to network 110 of FIG. 1 (or optionally a network interface card “NIC”) thereby facilitate communications to other remotely located network devices 140 ₁-140 _(N), as shown in FIG. 1. Hence, the network interface(s) 230 may be configured to transmit and/or receive access request messages from network devices 140 ₁, . . . or, 140 _(N) using a variety of communication protocols including, inter alia, Transmission Control Protocol/Internet Protocol (TCP/IP), Hypertext Transfer Protocol (HTTP), HTTP Secure (HTTPS), SMS, iMessage, or the like.

The non-transitory storage medium 220 operates as the data store. From a logical perspective, the non-transitory storage medium 220 includes a plurality of locations that are addressable by the processor(s) 210 and the network interface(s) 230 for storing logic, including monitoring logic 160 communicatively coupled to storage control logic 270 of the file system 130. As deployed, the storage control logic 270 controls the storage and retrieval of object(s) 200 from the non-transitory storage medium 220. The monitoring logic 160 is configured to monitor for signaling that identifies a state change in stored content within the non-transitory storage medium 220, where such signaling includes the notification message 170 that identifies the object that is being added to, deleted from or modified within the storage server 120.

According to one embodiment, as further shown in FIG. 2, the monitoring logic 160 may be a plug-in that is configured to detect the notification message 170 initiated by the storage control logic 270. Herein, the monitoring logic 160 may be implemented as part of the storage (file) system 130 to detect notification messages 170 that is responsive to certain types of state change events conducted on a stored object 200 that is being monitored or may be configured to detect certain selected notification messages via an Application Programming Interface (API) 280, as shown. The API 280 provides the monitoring logic 160 with accessibility to the storage control logic 270 within the kernel mode of the storage server 120 to detect certain changes to the stored object(s) that are initiated by a particular request message from any of network devices 140 ₁-140 _(N) of FIG. 1 being monitored. For instance, the request message may include a write request message (e.g., File Write request) received by the file system 130 from network devices 140 ₁ of FIG. 1 to update (modify and subsequent storage of) one of the stored object(s) 200. Another request message may include an object addition request message (e.g., Create File request) to add an object (e.g., a non-executable file) for storage within the data store 185 of the storage server 120.

In response to detecting a particular notification message 170 being monitored, the monitoring logic 160 extracts metadata that identifies the object undergoing a state change (herein, “object identifier”), which may identify a file currently stored on the storage server 120 that is being updated or a new file currently being written to (i.e., stored on) the storage server 120. Thereafter, the monitoring logic 160 operates in connection with the network interface(s) 230 to transmit the object identifier to the threat detection system 150 of FIG. 1, where the object identifier may be used by the threat detection system 150 to retrieve the object for conducting behavioral analysis on that object to determine whether the object is associated with a malicious attack. Alternatively, the object identifier may be provided with a copy of that object to the threat detection system 150 in lieu of the “pull” (fetch) mechanism generally described previously.

As shown in FIG. 3, an exemplary block diagram of a logical representation of the storage server 120 of FIG. 2 is shown. Herein, the storage server 120 comprises a user mode 300 and a kernel mode 330. Herein, in user mode 300, an application or other executing code is unable to directly access hardware or reference memory. Rather, the application or other executing code accesses hardware or memory via a system API. In kernel mode, however, a driver or other executing code may have complete and unrestricted access to the underlying hardware. Hence, kernel mode 330 is generally reserved for the lowest-level (highest privileged), most trusted file system functionality such as the storage control logic 270 of FIG. 2. Access by monitoring logic 160 within the user mode 300 to notification messages issued by the storage control logic 270 in response to state change events is provided through the API 280.

Herein, a request message 310 for accessing an object or request an update or storage of the object within the storage server 120 is provided from a kernel mode of the network device accessible to the user (e.g., network device 140 ₁ of FIG. 1) to storage control logic 270 of the file system situated within the kernel mode 330 of the storage server. The storage control logic 270 performs a state change event (e.g., modifies the file through a write access), and issues the notification message 170 as a return message for the access request message. The presence of the notification message 170 may be detected by the monitoring logic 160 via the API 280. In response, the monitoring logic 160 extracts the object identifier associated with the object undergoing a state change and provides the object identifier to the threat detection system 150 of FIG. 1, which may be used by the threat detection system 150 to retrieve the object.

III. Threat Detection System Architecture

Referring to FIG. 4, an exemplary embodiment of the threat detection system 150 that communicates with monitoring logic (e.g., a plug-in) deployed within a file system of the storage server 120 is shown. The threat detection system 150 is adapted to analyze the suspect object 180 associated with file that is newly stored, deleted or stored and modified within the file system 130. According to this illustrative embodiment, the threat detection system 150 may be communicatively coupled with the network 110 via interface logic 410, where the network 110 may operate as a public network such as the Internet or a private network (e.g., a local area network “LAN”, wireless LAN, etc.). The interface logic 410 is configured to receive some or all of the data within a detected notification message, most notably the metadata 190 that identifies a storage location of the suspect object 180 within the storage server 120, which is routed to the threat detection system 150. For instance, as an illustrative example, the interface logic 410 may be a data capturing device that automatically (or on command) accesses data stored in a storage system or another type of interface, such as a port, for receiving objects manually provided via a suitable dedicated communication link or from storage media such as a solid-state drive or flash drive.

As shown in FIG. 4, the interface logic 410 operates as a data capturing device that receives incoming data 424, namely the metadata (object identifier) 190 and/or the suspect object 180. Alternatively, the interface logic 410 can be integrated into an intermediary device in the communication path (e.g., an optional firewall, router, switch or other networked electronic device) or may be deployed as a standalone component, such as an appropriate commercially available network tap, as shown.

According to one embodiment of the disclosure, the metadata 190 may be used, at least in part by interface logic 410 when the suspect object 180 is not part of the incoming data 424, to determine if the object identifier, which identifies a storage location of the suspect object 180 in the storage server 120, is provided with the metadata 190. If so, the interface logic 410 initiates communications to fetch the suspect object 180 from the storage server 120. It is contemplated that the metadata 190 may be further used to determine protocols, application types and other information, which may be used by logic within the threat detection system 200 such as a scheduler 435 or other logic such as a virtual machine monitor (not shown) for example, to determine a particular software profile used for virtual machine (VM) configuration and/or VM operation scheduling. As an example, one or more software profiles may be used for initial configuration of guest software of one or more VMs 460 ₁-460 _(M) (M≥1) operating within dynamic analysis system 450. Fetched from a storage device 440, these software profile(s) may be directed to different types of applications (e.g., different versions of the same application type, different application types, etc.).

As further shown in FIG. 4, the threat detection system 150 includes the interface logic 410, the static analysis system 430, the scheduler 435, the storage device 440, the dynamic analysis system 450, classification engine 480, and/or reporting engine 485. Herein, according to this embodiment of the disclosure, the interface logic 410 receives data associated with the notification message, including an object identifier. In response to receipt of the object identifier, the interface logic 410 issues a request for the suspect object identified by the object identifier.

In response to receipt of the suspect object 180, the interface logic 410 may be configured to convert that object 180 into a format, if needed or as appropriate, on which scanning may be conducted by the static analysis system 430. This conversion may involve decompression of the object for example. It is contemplated that the interface logic 410 may conduct de-compilation, disassembly or other de-obfuscation activities on the captured object 424 to produce a formatted object 426. However, as shown below, the de-obfuscation and data extraction activities may be handled by logic within the static analysis system 430.

Referring still to FIG. 4, the static analysis system 430 may analyze information associated with the formatted object 426. Such analysis may include, but is not limited or restricted to, an analysis of the object type and may extract one or more characteristics (hereinafter “characteristic(s)”) associated with the formatted object 426, such as the object name, object type, size, path, or the like. According to this embodiment of the disclosure, the extracted characteristic(s) may be provided as static analysis (SA)-based results 470 to the classification engine 480 for subsequent analysis. Additionally or in the alternative, the static analysis system 430 may analyze the formatted object 426 itself by performing one or more checks. An example of the check may include one or more signature checks, which may involve a comparison of (i) content of the formatted object 426 and (ii) one or more pre-stored signatures associated with detected malware.

It is contemplated that the static analysis system 430 may further include processing circuitry (not shown) that is responsible for extracting or generating metadata contained within or otherwise associated with formatted object 426 from the interface logic 410. This metadata may be subsequently used by the scheduler 435 for initial configuration of one or more VMs 460 ₁-460 _(M) within the dynamic analysis system 450, which conducts run-time processing of at least a portion of the formatted object 426 as described below.

Although not shown, for a multiple VM deployment, a first VM 460 ₁ and a second VM 460 ₂ may be configured to run concurrently (i.e. at the same time or in an overlapping manner), where each of these VMs may be initially configured with different software profiles. As an alternative embodiment, the first VM 460 ₁ may be configured to run multiple processes involving a single type of application instance or multiple types of application instances concurrently or sequentially.

More specifically, after analysis of the formatted object 426 has been completed, the static analysis system 430 may provide at least a portion of the formatted object 426 (hereinafter generally referred to as “suspicious object” 428) to the dynamic analysis system 450 for in-depth dynamic analysis by the VMs 460 ₁-460 _(M). For instance, according to one embodiment of the disclosure, a first VM 460 ₁ may be adapted to analyze the suspicious object 428, which may constitute the object itself or a file path for accessing the object for example. Although not shown, it is contemplated that the dynamic analysis may be conducted remotely from the threat detection system 150 that is handling the static analysis, such as within a cloud service 445, or any other remotely located source.

According to one embodiment of the disclosure, the dynamic analysis system 450 features one or more VMs 460 ₁-460 _(M), where each VM 460 ₁, . . . , or 460 _(M) processes the suspicious object 428 within a run-time environment. Behavior monitoring logic is configured to be operable with one or more processes running in the VM 460 ₁, . . . , or 460 _(M), where each process may be associated with a different application instance, to collect behavioral information and, in some embodiments, the behavior monitoring logic can be selectively enabled or disabled.

Illustrated in FIG. 4 as an optional feature, the dynamic analysis system 450 may include processing logic 462 that is configured to provide anticipated signaling to the VM 460 ₁-460 _(M) during processing of the suspicious object 428, and as such, represents a source of or destination for communications with the suspicious object 428 while processed within that VM 460 ₁, . . . , or 460 _(M). As an example, the processing logic 462 may be adapted to operate by providing simulated key inputs from a keyboard, keypad or touch screen or providing certain other signaling without human involvement, as requested by the suspicious object 428 during run-time.

As shown, the dynamic analysis system 450 further comprises a data store 464 and correlation logic 466. The data store 464 may be used to provide local storage for analysis and detection rules as well as operate as a local log for information accessible to the correlation logic 466 for use in determining whether the object 428 is suspicious. This information may be part of the VM-based results 475 described below.

As shown in FIG. 4, the static analysis system 430 may be adapted to provide SA-based results 470 to the classification engine 480 while the dynamic analysis system 450 may be adapted to provide the VM-based results 475 to the classification engine 480. According to one embodiment of the disclosure, the SA-based results 470 may include information associated with the characteristics of the formatted object 426 that are potentially indicative of malware (e.g., source IP address, object size, etc.). Similarly, the VM-based results 475 may include information associated with monitored behaviors of the suspicious object 428 during processing, which may include abnormal or unexpected system or API calls being invoked, abnormal or unexpected memory accesses by one or more processes running in a first VM 460 ₁.

According to one embodiment of the disclosure, the classification engine 480 is configured to receive the SA-based results 470 and/or the VM-based results 475. Based at least partially on the SA-based results 470 and/or VM-based results 475, the classification engine 480 evaluates the characteristic(s) within the SA-based results 470 and/or the content associated with the monitored behaviors that is part of the VM-based results 475 to determine whether the suspicious object 428 should be classified as “malicious”. This evaluation may be based on data acquired through experiential knowledge or machine learning.

For instance, the classification engine 480 may conduct a probabilistic modeling process that assigns risk levels to different monitored behaviors of the suspicious object 428 being processed within at least a first VM 460 ₁. The risk levels may be aggregated to produce a value (e.g., a probability score or risk designation) that denotes whether the suspicious content 428 is malicious (e.g., associated with an exploit attack). Upon determining that the object 428 is associated with a malicious attack, the classification engine 480 may provide information 490 to identify the malicious object, including information that identifies one or more of the monitored behaviors, to the reporting engine 485.

The reporting engine 485 is configured to receive information 490 from the classification engine 480 and generate alert signals 492, especially in response to the suspicious object 428 being now classified as malicious. The alert signals 492 may include various types of messages, which may include text messages, email messages, video or audio stream, or other types of information over a wired or wireless communication path. The reporting engine 485 features an optional user interface (e.g., touch pad, keyed inputs, etc.) for customization as to the reporting configuration. The reporting engine 485 may further generate signaling 494 directed to the storage (file) system via the monitoring logic to identify that the suspect object is malicious and remediate the storage of a malicious object through removal or quarantining that object.

IV. General Operational Flow

Referring to FIG. 5, an exemplary embodiment of operations conducted during the inter-connectivity between the monitoring logic and the threat detection system of FIGS. 2-4 is shown. Initially, the storage server and the threat detection system are configured (block 500). More specifically, according to one embodiment of the disclosure, the monitoring logic (plug-in) is configured to detect certain communications initiated by the storage control logic of the file system. In particular, the monitoring logic may have access to an API that provides accessibility to notification messages issued by the storage control logic within the storage server. According to this embodiment, the notification messages may operate to acknowledge completion of a requested change in state of a stored object, such as an update of a stored file, storage of a new file on the storage server, or the like.

In response to receipt of an access request message from a remotely located network device for stored content within the storage server, followed by a subsequent storage request message, the file system conducts a state change event (writes the updated file on the storage server, creates and writes a new file on the storage server, etc.) as set forth in block 510. Thereafter, the storage control logic issues a notification message that identifies that the object has been altered, which is detected by the monitoring logic (block 520). The monitoring logic extracts an object identifier from the notification message and establishes communications with the threat detection system, if communications have not yet been established (block 530).

Thereafter, the threat detection system may utilize the communication channel to obtain a copy of the stored object (file) for behavioral analysis to determine whether the stored object includes malware (block 540). In response to a determination that the stored object includes malware, the threat detection system may issue signaling to the file system to remediate the infection by quarantine or removal of the malicious suspect object within the data store of the storage server (block 550). This may involve re-locating the malicious suspect object into a certain portion of memory within the data store and substituting the suspect object with a text object. The text object, when accessed by a user through a network device, causes the display of a message that advises the user of the remediation technique conducted and perhaps information to contact an administrator for the storage server.

In the foregoing description, the invention is described with reference to specific exemplary embodiments thereof. It will, however, be evident that various modifications and changes may be made thereto without departing from the broader spirit and scope of the invention as set forth in the appended claims. 

What is claimed is:
 1. A non-transitory computer readable medium including software that, when executed by one or more processor, monitors for a triggering event causing a malware analysis to be conducted on an object and performing operations comprising: an Application Programming Interface (API) provided as an interface to a file system; and monitoring logic remotely located from the file system, the monitoring logic, when executed by the one or more processors, monitors the API to detect a notification message that is directed to a destination other than the monitoring logic and identifies a state change event representing an activity causing a change in state of a data store associated with the file system to occur, the notification message, at least in part, triggering a malware analysis to be conducted on an object associated with the state change event.
 2. The non-transitory computer readable medium of claim 1, wherein the state change event occurs in response to a requested modification of the object stored in the data store of the file system.
 3. The non-transitory computer readable medium of claim 1, wherein the state change event occurs in response to a request to store the object into the data store associated with the file system.
 4. The non-transitory computer readable medium of claim 1, wherein responsive to detecting the notification message, the triggering of the malware analysis includes extracting an identifier from the notification message, the identifier provides information to identify a storage location of the object in the data store associated with the file system.
 5. The non-transitory computer readable medium of claim 4, wherein the identifier includes a file path or a unique name assigned to the object.
 6. The non-transitory computer readable medium of claim 1, wherein the triggering of the malware analysis includes recovering the object and conducting a dynamic analysis on the object, the dynamic analysis being conducted by the dynamic analysis system that includes behavior monitoring logic configured to monitor behaviors of one or more virtual machines processing the object recovered from the data store of the file system.
 7. The non-transitory computer readable medium of claim 6, wherein the malware analysis further includes conducting a static analysis of the object by a static analysis system, the static analysis includes an analysis of at least one of (i) one or more characteristics associated with the object or (ii) information that is part of the object.
 8. The non-transitory computer readable medium of claim 1, wherein the monitoring logic being provided access to storage control logic within the file system via the API to detect the notification message responsive to the state change event being a return message for an access request message, the storage control logic controls storage and retrieval of objects from the file system.
 9. The non-transitory computer readable medium of claim 1, wherein the monitoring logic being configured to intercept the notification message where the notification message is directed to a destination other than the monitoring logic.
 10. A system comprising: one or more processors; and a memory communicatively coupled to the one or more processors, the memory including a file system including an Application Programming Interface (API) operating as an interface to the file system, and monitoring logic remotely located from the file system, the monitoring logic, when executed by the one or more processors, monitors the API to detect a notification message that is directed to a destination other than the monitoring logic and identifies a state change event representing an activity causing a change in state of a data store associated with the file system to occur, the notification message, at least in part, triggering a malware analysis to be conducted on an object associated with the state change event.
 11. The system of claim 10, wherein the state change event occurs in response to a requested modification of the object stored in the data store of the file system.
 12. The system of claim 10, wherein the state change event occurs in response to a request to store the object into the data store associated with the file system.
 13. The system of claim 10, wherein responsive to detecting the notification message, the triggering of the malware analysis includes extracting an identifier from the notification message, the identifier provides information to identify a storage location of the object in the data store associated with the file system.
 14. The system of claim 13, wherein the identifier includes a file path or a unique name assigned to the object.
 15. The system of claim 10, wherein the triggering of the malware analysis includes recovering the object and conducting a dynamic analysis on the object, the dynamic analysis being conducted by the dynamic analysis system that includes behavior monitoring logic configured to monitor behaviors of one or more virtual machines processing the object recovered from the data store of the file system.
 16. The system of claim 15, wherein the malware analysis further includes conducting a static analysis of the object by a static analysis system, the static analysis includes an analysis of at least one of (i) one or more characteristics associated with the object or (ii) information that is part of the object.
 17. The system of claim 10, wherein the monitoring logic being provided access to storage control logic within the file system via the API to detect the notification message responsive to the state change event being a return message for an access request message, the storage control logic controls storage and retrieval of objects from the file system.
 18. The system of claim 10, wherein the monitoring logic being configured to intercept the notification message where the notification message is directed to a destination other than the monitoring logic.
 19. A computerized method for monitoring a file system, including an Application Programming Interface (API) operating as an interface, by monitoring logic remotely located from the file system, comprising: monitoring the API to detect a notification message that is directed to a destination other than the monitoring logic; and identifying a state change event representing an activity causing a change in state of a data store associated with the file system to occur, wherein the notification message, at least in part, triggering a malware analysis to be conducted on an object associated with the state change event.
 20. The computerized method of claim 19, wherein the state change event occurs in response to a requested modification of the object stored in the data store of the file system.
 21. The computerized method of claim 19, wherein the state change event occurs in response to a request to store the object into the data store associated with the file system.
 22. The computerized method of claim 19, wherein responsive to detecting the notification message, the triggering of the malware analysis includes extracting an identifier from the notification message, the identifier provides information to identify a storage location of the object in the data store associated with the file system.
 23. The computerized method of claim 22, wherein the identifier includes a file path or a unique name assigned to the object.
 24. The computerized method of claim 19, wherein the triggering of the malware analysis includes recovering the object and conducting a dynamic analysis on the object, the dynamic analysis being conducted by the dynamic analysis system that includes behavior monitoring logic configured to monitor behaviors of one or more virtual machines processing the object recovered from the data store of the file system.
 25. The computerized method of claim 24, wherein the malware analysis further includes conducting a static analysis of the object by a static analysis system, the static analysis includes an analysis of at least one of (i) one or more characteristics associated with the object or (ii) information that is part of the object.
 26. The computerized method of claim 19, wherein the monitoring logic being provided access to storage control logic within the file system via the API to detect the notification message responsive to the state change event being a return message for an access request message, the storage control logic controls storage and retrieval of objects, including the object, from the file system.
 27. The computerized method of claim 19, wherein the monitoring logic being configured to intercept the notification message where the notification message is directed to a destination other than the monitoring logic. 